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Abstract 

This paper investigates how knowledge workers utilize 
^ the Web to seek external information as part of their 
§ daily work. Thirty four participants from seven 
oo companies were interviewed about their information 

^ needs and preferences. In addition, a custom-developed 
Q software application recorded each participant's Web 
W behavior for a two week monitoring period. To 

understand better the significant episodes of activity 
identified in the Web tracking logs, a second round of 
personal interviews was conducted. The study found that 
people who use the Web as part of their work engage in 
four complementary modes of undirected viewing, 
conditioned viewing, informal search, and formal search. 
Moreover, each mode is characterized by information 
seeking moves based on recurrent sequences of browser 
functions and features. Based on these findings, a model 
of Web use is presented, as well as some suggestions for 
supporting and enhancing Web information seeking in 
organizations. 



1. Introduction 

The objective of this paper is to present field research 
on how managers and professionals use the Web as a new 
information technology to obtain information that helps 
them carry out their daily work. In terms of its potential, 
the Web stands to deliver a broad range of functions for 
content-delivery, communication, and collaboration that 
could benefit everyone in an organization (web pages, 
bulletin boards, chats, discussions, e-mail, file transfers, 
remote logins, and so on). However, as technology-in- 
use, little is known of how this tool is utilized in practice. 

To address this situation, this paper describes an in- 
depth field investigation of organizational Web use. The 
paper is organized into five sections. The first 
familiarizes the reader with recent research on Web 
information seeking which relies heavily on the 
collection and analysis of quantitative metrics. The 
second provides an overview of elements from 
information seeking research in the organizational 



science and information science literature which 
concentrates on understanding the context behind 
information seeking. The third describes the paper’s 
research design which utilizes both quantitative and 
qualitative measures in order to gain an understanding of 
both Web use metrics and the context behind 
organizational information seeking. The fourth presents 
results and findings in terms of a behavioral model of 
Web use. The fifth postulates, at an operational level, 
various ways of supporting and enhancing Web 
information seeking in organizations. 

2. Recent Web Information Seeking Studies 

Past studies have traditionally focused on the more 
quantitative aspects of Web information seeking activity. 
Catledge and Pitkow were among the first to publish a 
major study of Web browsing behavior by modifying the 
source code for a version of XMosaic, the dominant X 
Windows browser at the time [1], They configured the 
browser to generate a client-side log file that showed user 
navigation strategies and interface selections. They 
released this modified browser to Computer Science 
department students who ran Mosaic from X Terminals 
in the various departmental computing labs at Georgia 
Tech. Results were measured using a task-oriented 
method. They determined session boundaries by 
analyzing the time between each event for all events, and 
adopted the heuristic that a lapse of 25.5 minutes or 
greater indicated the end of a “session.” This heuristic is 
currently the most-commonly used for delimiting 
sessions. The study yielded some unexpected results. Web 
pages that users bookmarked did not match the most- 
popular sites visited as a whole from the group. Only 2% 
of Web pages were either saved locally or printed. These 
results may have been influenced by limitations in the 
browser (XMosaic’s bookmarking capabilities), or the 
availability of printers in the work place. Catledge and 
Pitkow also hypothesized that users in their study 
categorized as “browsers” spend less time on a Web page 
than “searchers.” 



1 Copyright 2000 IEEE, Published in the Proceedings of the Hawaii International Conference on System Sciences, 
January 4-7, 2000, Maui, Hawaii. 
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Tauscher and Greenberg focused on the history 
mechanisms that Web browsers use to manage recently- 
requested Web pages browsed in a session of Web 
information seeking [2, 3]. They also used a modified 
XMosaic browser to collect Web browsing data for over 
six weeks from 23 participants. They recorded and 
examined the rate that Web pages were visited; how 
users visited old and new Web pages; the distance (in 
terms of URLs) between repeated Web page visits; the 
frequency of Web page visits, the extent of browsing in 
one cluster of Web pages; and repeated sequences of 
path-following behavior. Most significantly, they found 
that 58% of the pages visited during a Web browsing 
session were re-visits. This seems to suggest that Web 
information seeking may be influenced by Web browser 
functionality that makes it easy to go back to recently 
viewed pages. Tauscher and Greenberg showed that 
overall, users also only access a few pages frequently 
(60% once, and 19% twice) and browse in very small 
clusters of pages. They contend that Web browsing 
activity is a recurring system where users predominantly 
repeat activities they had invoked before, while still 
selecting new actions from the many that are possible. 
People explained that they revisited Web pages because 
the information contained by them changes; they wish to 
explore the page further; the page has a special purpose 
(e.g. search engine, home page); they are authoring a 
page; or the page is on a path to another revisited page. 
Thus, Tauscher and Greenberg identified seven Web 
browsing patterns: first-time visits to a cluster of pages; 
revisits to pages; page authoring (where the subject used 
Reload to view the newly modified page); use of web- 
based applications; hub-and-spoke visits (navigating to 
each new page from around a central page); a guided tour 
where links guide navigation through the Web pages; 
and a depth-first search where link paths are followed 
without returning to the first page in some cases. 

More recently, Huberman, Pirolli, Pitkow and Lukose 
discovered several strong regularities of Web user surfing 
patterns and developed a mathematical law of surfing 
that determines the probability distribution of the number 
of pages a user visits within a Web site [4]. They started 
with a model of probability of the number of links a user 
might follow on a Web site. Next they calculated a value 
for the current page and related this value to the next 
page accessed that leads to examining the cost of 
continuing surfing. When the cost of moving to the next 
Web page is more than its expected value, the user stops 
Web surfing. They analyzed data collected from a sample 
of AOL (America Online) users for each of five days, a 
huge amount of data. One day alone (December 5, 1997) 
yielded 23,692 AOL users who collectively surfed 
3,247,054 Web pages from 1,090,168 unique Web pages. 



This amount of data is staggering compared to previous 
studies of Web use. 

In a related study, Huberman et. al. examined Web 
server logs of the Xerox external Web site in order to 
obtain a constrained set of Web page requests. They used 
“cookies’' to help track the paths of individual users as 
they surfed through the Web site. Generally, they found a 
“strong fit” which was consistent through each day of the 
study. By applying this model along with a spreading 
activation algorithm, they could predict the number of 
requests for each Web page in a Web site. As they point 
out, this has implications for e-commerce applications 
and Web site organization, not to mention providing a 
more robust understanding of information seeking 
patterns on the Web. Overall, their study echoes other 
research in suggesting that surfing patterns on the Web 
display strong statistical regularities that can be described 
by a universal law. In addition, the success of the model 
points to the existence of utility maximizing behavior 
underlying surfing. 

3. Information Seeking in Organizations 

As described, recent research in Web use activity has 
focused on the more quantitative aspects of Web 
information seeking. We suggest that a fuller picture 
would emerge if quantitative analysis were 
complemented with qualitative analysis of the context in 
which Web information seeking was situated (addressing 
questions such as why information was needed, and how 
the information was used). Past research from 
organizational science and information science may offer 
help in this area. Specifically, we turn our attention to 
four modes of organizational scanning or external 
information seeking from organization science. This is 
followed by a description of six categories of information 
seeking moves derived from empirical research in 
information science. 

3.1. Organizational Scanning Modes 

The initial field work of Aguilar and the subsequent 
theoretical elaboration by Weick and Daft suggest that 
organizations scan or seek external information in four 
distinct modes: undirected viewing, conditioned viewing, 
informal search, formal search [5, 6, 7]. In undirected 
viewing, the individual is exposed to information with no 
specific informational need in mind. The overall purpose 
is to scan broadly in order to detect signals of change 
early. Many and varied sources of information are used, 
and large amounts of information are screened. The 
granularity of information is coarse, but large chunks of 
information are quickly dropped from attention. The goal 
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of broad scanning implies the use of a large number of 
different sources and different types of sources. 

In conditioned viewing, the individual directs viewing 
to information about selected topics or to certain types of 
information. The overall purpose is to evaluate the 
significance of the information encountered in order to 
assess the general nature of the impact on the 
organization. The individual has isolated a number of 
areas of potential concern from undirected viewing, and 
is now sensitized to assess the significance of 
developments in those areas. 

During informal search, the individual actively looks 
for information to deepen the knowledge and 
understanding of a specific issue. It is informal in that it 
involves a relatively limited and unstructured effort. The 
overall purpose is to gather information to elaborate an 
issue so as to determine the need for action by the 
organization. 

During formal search, the individual makes a 
deliberate or planned effort to obtain specific information 
or types of information about a particular issue. Search is 
formal because it is structured according to some pre- 
established procedure or methodology. The granularity of 
information is fine, as search is relatively focused to find 
detailed information. The overall purpose is to 
systematically retrieve information relevant to an issue as 
a basis for developing a decision or course of action. 

3.2. Information Seeking Behaviors 

Based on their field research of information seeking 
by social scientists, research physicists and chemists, and 
engineers and research scientists in an industrial firm, 
Ellis et al develop a model that identified six generic 
categories of information seeking activities: starting, 
chaining, browsing, differentiating, monitoring, and 
extracting [8, 9, 10]. 

“Starting” comprises those activities that form the 
initial search for information — identifying sources of 
interest that could serve as starting points of the search. 
Initial sources often point to, suggest, or recommend 
additional references. Following up on these new leads 
from an initial source is the activity of “chaining”. 
Chaining can be backward or forward. Backward 
chaining takes place when pointers or references from an 
initial source are followed, whereas forward chaining 
identifies and follows up on other sources that refer to an 
initial source. Although it can be an effective way of 
broadening a search, forward chaining is much less 
commonly used. 

Having located sources and documents, “browsing” is 
the activity of semi-directed search in areas of potential 
search. The individual often simplifies browsing by 



looking through tables of contents, lists of titles, subject 
headings, names of organizations or persons, abstracts 
and summaries, and so on. During “differentiating”, the 
individual filters and selects from among the sources 
scanned by noticing differences between the nature and 
quality of the information offered. The differentiation 
process is likely to depend on the individual’s 
disciplinary background, prior or initial experiences with 
sources, word-of-mouth recommendations, or reviews in 
published sources. 

“Monitoring” is the activity of keeping abreast of 
developments in an area by regularly following particular 
sources. The individual monitors by concentrating on a 
small number of what are perceived to be core sources. 
“Extracting” is the activity of systematically working 
through a particular source or sources in order to identify 
material of interest. Extracting tends to be labor 
intensive, and is more likely when there is a need for 
comprehensive or historical information on a topic. 

Most of the information seeking behavior categories in 
Ellis’ model are already being supported by capabilities 
available in common Web browser software. Thus, an 
individual could begin surfing the Web from one of a few 
favorite starting pages or sites (starting); follow links to 
related information resources — in both backward and 
forward linking directions (chaining); scan the Web 
pages of the sources selected (browsing); bookmark 
useful sources for future reference and visits 
(differentiating); subscribe to e-mail based services that 
alert the user of new information or developments 
(monitoring); and search a particular source or site for all 
information on that site on a particular topic (extracting). 

4. Data Collection & Analysis 

Recall that the objective of this paper to investigate 
and understand how managers and professionals use the 
Web as a new information technology to obtain 
information that helps them carry out their daily work. 
To facilitate this, we present a research methodology that 
combines both quantitative and qualitative measures. We 
believe that doing so can allow us to gain a fuller picture 
of organizational Web use than recent Web information 
seeking studies. Specifically, our research design 
combines quantitative analysis of Web tracking logs with 
qualitative analysis of personal interviews which probe 
the context in which Web information seeking is situated. 
To frame the analysis of this rich data set, we utilize both 
Aguilar’s modes of scanning and Ellis’ seeking behaviors 
to produce a behavioral model of organizational 
information seeking on the Web. 
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4.1. Participants 

Thirty four participants from seven companies took 
part in the study. Since participants who regularly use the 
Web as part of their daily work were preferred, 
volunteers were canvassed through invitations at various 
IT-related workshops and conferences; postings at 
technology-focused listservs; and direct e-mail contact 
with colleagues and associates at large technology- 
oriented companies. 

The seven companies comprised a major national 
bank, a large utility company, a national magazine 
publisher; a medium-sized University research library, a 
medium-sized marketing agency, and two small software 
consulting firms. The participants held jobs as IT 
technical specialists or analysts; managers; researchers; 
marketing staff; consultants; and administrative staff. 

All of the users in this study primarily utilized the 
Web for business purposes as an integral part of their 
work responsibilities and activities. In most cases, 
participants were connected to the Internet through 
continuous leased-line access and used relatively high- 
powered machines. Many of the participants would be 
generally regarded as technically proficient Web users. 

4.2. Data Collection 

Three methods of data collection were employed: 
questionnaire survey; a WebTracker application that 
recorded Web browser actions; and personal interviews 
with participants. 

The questionnaire survey was administered at the 
participants’ work places, during the first site visit. The 
survey contained 12 questions that identified the 
information sources the participants used, their frequency 
of using these sources, and their perception of the 
perceived accessibility and quality of each of the sources. 
A wide range of sources was covered, including personal 
and impersonal sources (print and electronic), as well as 
internal and external sources. There were also questions 
on the amount of time and frequency of using the Web 
for information seeking. Furthermore, through informal 
conversations during the visit, research team members 
were able to develop a general impression of the style and 
scope of each participant’s Web use. 

The custom-developed WebTracker application was 
installed on each participant’s computer, and it ran 
transparently whenever the participant’s Web browser 
was being used. The WebTracker application was left to 
run on participants’ computers for two- week periods. 
Because the WebTracker was essentially ‘invisible,’ it 
was not expected to influence participants’ normal Web- 
use behaviors. After two weeks, WebTracker was 



uninstalled, and the WebTracker log file collected for 
analysis. The WebTracker recorded how each participant 
was using the browser to navigate the Web and 
manipulate information from the Web. Specifically, it 
recorded all URL calls and requests, as well as most 
browser menu selections, and wrote these events into a 
local log file on each participant’s hard disk. Browser 
menu selections captured included “Open URL or File,” 
“Reload,” “Back,” “Forward,” “Add to Bookmarks,” “Go 
to Bookmark,” “Print,” and “Stop.” Because all URL 
calls and menu selections were date-time stamped as they 
were written into the WebTracker log, the research team 
was able to subsequently reconstruct move-by-move how 
participants looked for information on the Web during 
particular episodes. 

The WebTracker log was pre-analyzed to prepare for 
personal interviews with each participant. The interview 
format was based on the principles of the Critical 
Incident Technique, in which the ‘incident’ to be studied 
should be recent, sufficiently complete, and its effects or 
consequences sufficiently clear [11]. In the interviews, 
participants described two ‘critical incidents’ of Web 
information seeking and use in reply to the following 
question: 

“Please try to recall a recent instance in which you 
found important information on the Web, information 
that led to some significant action or decision. Would you 
please describe that incident for me in enough detail so 
that I can visualize the situation?” 

Where appropriate, participants were prompted with 
the names of Web sites that were indicated in their 
WebTracker log files. Besides ‘critical incidents,’ 
participants were also invited to comment more broadly 
on their use of the Web, including their general Web-use 
strategies and preferences, as well as what they perceived 
to be positive and negative aspects of Web use. 

4.3. Data Analysis 

Data analysis was completed in two passes. The 
purpose of the first pass was to identify episodes of 
significant information seeking and to classify each 
episode according to the four modes of information 
seeking described earlier in section 3.1. The purpose of 
the second pass was to identify the browser activities that 
best characterized each episode of information seeking. 

In Pass 1, significant episodes of Web-based 
information seeking were identified from the personal 
interview transcripts as well as the WebTracker logs. 
During interviews, participants were asked to recall 
“critical incidents” or significant episodes of finding and 
using information on the Web. By reading the 
transcripts, each episode was analyzed according to its 
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information need, amount of effort, number of web 
sources consulted, and information use. Based on this 
analysis, an episode would be categorized as one of the 
four modes of scanning (undirected / conditioned 
viewing; informal / formal searching). WebTracker logs 
were also examined to identify additional significant 
episodes. Two criteria were used to select episodes: the 
episode consumed a substantial amount of time and 
effort; or the episode was a frequently or regularly 
repeated activity. 

Out of a total of 61 episodes identified, 12 were 
categorized as undirected viewing. The most common 
example of undirected viewing consisted of visits to 
general news websites such as those of NewsEdge, 
news.com, and newspapers. In the words of one 
participant, the goal was to “keep up with what’s 
happening in the world.” General news sites acted as 
gateways to information covering many different subject 
areas, and provided an efficient way of surveying current 
developments without a specific information need in 
mind. Other channels of undirected viewing included 
portal sites such as CANOE, and large magazine sites 
such as ZDnet. 

Eighteen episodes were categorized as conditioned 
viewing. The most common examples were regular 
return visits to bookmarked sites, and starting from a 
particular page that contained links to sites of interest. 
Thus, a number of participants regularly visited the 
websites of Microsoft, Novell, and Sun Microsystems in 
order to monitor new content in selected sections. One 
participant regularly visited the Novell site for 
information on upcoming training courses, seminars, and 
software updates. Another returned to Sun’s Java home 
page periodically to follow developments in the 
Electronic Commerce Framework and E-commerce tools. 
A third person habitually scanned the Canada Newswire 
Site to view press releases from the Federal and 
Provincial governments. Yet another customized his 
start-page at MSN with his own topic headings and 
keywords. 

Twenty three episodes were categorized as informal 
search, and these constituted the largest group. The most 
common examples of informal search were when 
participants made use of specific query terms such as 
names of companies, products or technologies to perform 
simple searches on easily accessible search engines. 
There were several examples of selecting search engines 
that were local to a specific site (e.g. a search engine 
maintained by a company that only indexed its own web 
pages). Thus, two participants used the local search 
engine on the website of Forrester Research (a market 
research firm) to retrieve information about specific 
companies; another participant used the search engine at 
the Environmental Protection Agency to retrieve 



information on ventilation-heating systems for school 
buildings. Several of the informal searches used well- 
known search services from Yahoo and AltaVista. 

Eight episodes were categorized as formal search. 
Here, participants were intending to use the information 
formally (e.g. to write policy or planning documents, to 
provide definitions). Three formal searches utilized 
several search engines, including meta search services. 
Two searches attempted to be exhaustively 
comprehensive: one used four meta search engines to 
locate a good example of an action plan that could be 
formally presented to a manager; the other used the 
DejaNews search engine to retrieve two author profiles 
and scan all their postings. Another search was carried 
out over four days, retrieving high quality resources on 
Women Advocacy to be included on an institutional site 
for International Women and Human Rights. 

For each of the significant information seeking 
episodes categorized in Pass 1, the corresponding 
sections of the WebTracker log were analyzed in a 
second pass to determine the browser-based actions that 
best characterized each episode. 

The WebTracker log files were tabulated into large 
spreadsheets with entries arranged in chronological 
sequence. Each entry contained a date-time value, 
followed by a URL or a browser menu action name. Thus 
it was possible to examine the information seeking moves 
in chronological order in each of the 61 episodes. Data 
about the sequence of site visits, repetitions of these 
sequences, movements backwards and forwards between 
pages, the use of bookmarking, the selection of sites from 
stored bookmarks, the use of search engines, printing, 
and other actions and events captured by the WebTracker 
were examined to trace the selection and development of 
information seeking moves over the duration of each 
episode. Using the criteria presented earlier based on 
Ellis’ model in section 3.2, information seeking moves 
were analyzed to infer whether moves may be categorized 
as starting, chaining, browsing, differentiating, 
monitoring, or extracting. 

The most common examples of starting moves took 
the form of participants starting their Web sessions from 
(1) jumpsites that contained links of interest; (2) portal 
sites; and (3) Intranet entry pages of their organizations. 
Chaining moves occurred when participants followed 
links from the starting page or some other page. 
Chaining could be in either direction 
(backward/forward). Browsing moves occurred when 
participants looked through top-level pages, examined 
lists of headings, or viewed sitemaps. Differentiating 
moves were when participants bookmarked a page, 
printed it, or copied its contents. Another indication of 
differentiating was when a person went direcly to a 
specific site of known content (e.g. the Microsoft site) by 
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entering its URL. Monitoring moves were when 
participants revisited favorite sites (that have for example 
been bookmarked or entered into a customized list/page). 
Although this was uncommon, another indication would 
be when participants signed up for email or alert services 
that informed them of new content on the monitored 
pages. Extracting moves were characterized by 
participants systematically working through a website to 
extract information of interest. A common method of 
extracting was to use local search engines that indexed 
material at their parent sites. 

5. Results and Discussion 

Overall, sixty-one episodes of ‘significant’ 

information seeking were identified and categorized 
according to their information modes. The majority of 
the episodes were classified as informal search (23) and 
conditioned viewing modes (18). A smaller number of 



episodes were undirected viewing (12) and formal search 
(8). From there, the episodes in each mode were 
examined in terms of their Web moves. Figure 1 
illustrates how the episodes were classified across Web 
modes and moves in an empirical model of Web use. 

In the undirected viewing episodes, data collected by 
the WebTracker application indicated that the most 
frequently occurring moves were starting and chaining. 
Thus, participants began at favorite starting pages (news 
or portal sites) and followed links that they found 
interesting on those pages. This was usually 
characterized by repetitive movement back and forth 
using the starting page as anchor. 

One would expect to see many instances of starting 
and chaining in the undirected viewing mode on the 
Web. Starting occurs when viewers begin their web use 
on pre- 
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Figure 1. An Empirical Model of Web Use 



selected default home pages, or when they visit a favorite 
page or site to begin their viewing (such as news, 
newspaper, or magazine sites). Chaining occurs when 
viewers notice items of interest (often by chance), and 
then follow hypertext links to more information on those 
items. Forward chaining of the sort just described is the 
most typical during undirected viewing. Backward 
chaining is also possible, since search engines can be 
used to locate other Web pages that point to the site that 
the user is currently at. 

In the conditioned viewing episodes, the most 
frequently occurring moves were differentiating, 
browsing, and monitoring. Thus, participants selected a 



bookmarked page/site, or entered the URL of a site they 
remembered (differentiating). Another example of 
differentiating was when participants printed useful 
pages for their own files or to show to others. These 
sites/pages were then examined to locate new content of 
interest (browsing). The most important characteristic of 
conditioned viewing was that participants regularly or 
frequently returned to their selected or differentiated 
sites/pages to check for new information (monitoring). 

One would expect browsing, differentiating, and 
monitoring to be common in the conditioned viewing 
mode on the Web. Differentiating occurs as viewers 
select Web sites or pages that they expect to provide 
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relevant information. Sites may be differentiated based 
on prior personal visits, or recommendations by others 
(such as word-of-mouth or published reviews). 
Differentiated sites are often bookmarked. When visiting 
differentiated sites, viewers browse the content by 
looking through tables of contents, site maps, or list of 
items and categories. Viewers may also monitor highly 
differentiated sites by returning regularly to browse, or by 
keeping abreast of new content (through, for example 
subscribing to newsletters that report new material on the 
site). 

In the informal search episodes, the most frequently 
occurring moves were differentiating and localized 
extracting. Thus, participants went directly to selected 
sites where they expected that the searching they intend 
to do would likely yield results, e.g. going to a market 
research firm’s site to search for company data, or to a 
software vendor site to search for software patches 
(differentiating). Searching at these sites would make use 
of the local search engines that were dedicated to 
retrieving information from those sites (localized 
extracting). Some participants frequently returned to 
specific sites to perform their informal searches 
(monitoring). 

One would expect differentiating, extracting, and 
monitoring to be typical during informal search on the 
Web. Again, informal search is likely to be attempted at a 
small number of Web sites that have been differentiated 
by the individual, based on the individual’s knowledge 
about these sites’ information relevance, quality, 
affiliation, dependability, and so on. Extracting is 
relatively “informal” in the sense that searching would be 
localized to looking for information within the selected 
site(s). Extracting is also likely to make use of the basic, 
‘simple’ search features or commands of the local search 
engine, in order to get at the most important or most 
recent information, without attempting to be 
comprehensive. Monitoring becomes more proactive if 
the individual sets up push channels or software agents 
that automatically find and deliver information based on 
keywords or subject headings. 

In the formal search episodes, the most frequently 
occurring move was a relatively intensive and thorough 
form of extracting, compared with the localized 
extracting that characterized informal searching. Thus, 
participants systematically worked through a number of 
search engines or meta search engines so as to find (all) 
important information about a topic or item. Formal 
searches often involved the use of search engines known 
for their comprehensive coverage, currency, or the 
inclusion of historical data. 

One would expect primarily extracting operations 
with some complementary monitoring activity during 
formal search on the Web, however for this group of 



participants there were no explicit instances of 
monitoring to support extracting. Formal search makes 
use of search engines that cover the Web relatively 
comprehensively, and that provide a powerful set of 
search features that can focus retrieval. Because the 
individual wishes not to miss any important information, 
there is a willingness to spend more time in the search, to 
learn and use complex search features, and to evaluate 
the sources that are found in terms of quality or accuracy. 
Formal search may be two-staged: multi-site searching 
that identifies significant sources is then followed by 
within-site searching. Within-site searching may involve 
fairly intensive foraging. Extracting may be supported by 
monitoring activity, again through services such as Web 
site alerts, push channels/agents, and e-mail 
announcements, in order to keep up with late-breaking 
information. 

The distribution of information seeking episodes 
identified in the Web tracking data suggests that people 
who use the Web as part of their work engage in four 
complementary modes of information seeking. Each 
mode is set apart by its information needs, information 
seeking scope and effort, and the purpose of information 
use. Moreover, each mode of information seeking is 
characterized by information seeking moves that were 
revealed through recurrent sequences of participants’ use 
of browser functions and features. Undirected viewing 
was mainly characterized by starting and chaining; 
conditioned viewing by differentiating, browsing, and 
monitoring; informal search by differentiating, and 
localized extracting; and formal search by systematic, 
thorough extracting. 

The study also operationalizes and measures the six 
categories of information seeking behaviors identified by 
Ellis as browser-based actions and events [8, 9, 10]. 
Recurrent patterns of these actions would indicate that a 
user is working in a particular mode of viewing or 
searching on the Web. For example, repeated sequences 
of starting and chaining might suggest undirected 
viewing (moving back and forth visiting links on a 
starting page); while sequences of differentiating and 
extracting might suggest informal search (going to a 
bookmarked site and doing a local search). 

Two other observations can be made. The first 
concerns monitoring which is keeping up in an area by 
regularly following particular core or important sources. 
Two forms of monitoring are possible on the Web: “pull” 
monitoring is when a user selects a bookmark or enters a 
URL to revisit a site; “push” monitoring is when a user 
automatically receives alerts that a monitored site has 
been updated. Common methods of push monitoring on 
the Web include subscribing to email newsletters or alerts 
from the monitored site; setting up a personalized profile 
or channel; and subscribing to services that track content 



changes on selected sites. Although most participants in 
this study would be considered Web-savvy, very few 
made use of push monitoring techniques: one did use an 
email alert service; three others tried out a push service, 
but only for a limited time. 

The second observation concerns extracting. On the 
Web, extracting is systematically searching through one 
or more sites in order to locate information of interest at 
those sites. In this study, most episodes of extracting 
employed basic searching strategies. For the most part, 
search formulations were relatively simple, with 
advanced features such as Boolean operators, and word 
truncation or proximity operators rarely utilized. This 
was the case even when participants appeared to be 
working in the formal search mode. There were no 
instances of participants accessing search-engine help 
instruction pages to improve their searches. 

6. Implications 

The research presented here focuses on people who 
use the Web to support their daily work activities. They 
engage in a range of complementary modes of 
information seeking, varying from undirected viewing 



that does not pursue a specific information need, to 
formal searching that retrieves specific information for 
action or decision making. Each mode of information 
seeking on the Web is distinguished by recurrent 
sequences of browser actions initiated by the information 
seeker. Thus, undirected viewing is characterized by 
starting and chaining actions; conditioned viewing is 
characterized by differentiating, browsing, and 
monitoring; informal search is characterized by 
differentiating and localized extracting; and formal 
search consists of systematic, thorough extracting. 

This research has gathered empirical data to deliver a 
Web use model based on information seeking modes and 
moves. By understanding how organizational participants 
actually utilize the Web in work practice, we may 
postulate, at an operational level, ways of supporting and 
enhancing the information seeking of Web users as they 
engage in the four modes of information seeking 
portrayed in this study. Example suggestions are shown 
in Table 1 below. 



Table 1. Suggestions for Enhancing Organizational Web Use 



Web Use Mode 


Enhancing Web Use 


Undirected viewing: 
starting and chaining 


• Introduce systems that can search or recommend new jumpsites or "similar-to" sites 

• Encourage people in a group to share bookmarks, Web pages, URLs 

• Design corporate portals to support undirected, serendipitous viewing 


Conditioned viewing: 
browsing, differentiating, 
monitoring 


• Train users to evaluate and escalate priority or importance of information 

• Make it easy for users to share Web-based information via email or in online forums 

• Introduce users to services that allow them to subscribe to and be notified about new 
content on Web pages 


Informal search: 
differentiating, monitoring, 
extracting 


• Pre-select high quality sources and search engines for quick, informal searches 

• Prepackage good search strategies developed by subject matter experts; allow users 
to view these strategies and learn from them 

• Educate users on how to evaluate information provenance and quality 


Formal search: 
extracting 


• Educate users about full range of information sources that should be considered for 
comprehensive search: print, online, human sources 

• Educate users about when to use: commercial online database services; the library o 
information resource center; information brokers/professionals 

• Train users on advanced search techniques: narrowing or broadening a search; 
balancing precision and recall; backward and forward chaining. 
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